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PREFACE 


This study is a contribution to research conducted by The RAND 
Corporation in the field of game theory. The formulation presented 


in this Memorandum has theoretical application in both military and 


civilian situations where searcher and evader tactics are involved. 











SUMMARY 


The solution of the following hide and seek game is presented: 
At each move Player I, the evader, is allowed to hide in one room, 
while Player II, the searcher, is allowed to search some given number 
of rooms, The restriction is made that Player II searches without 
repetition, that is, he is never allowed to return to a room he has 
previously searched, It is shown that if the payoff to Player I is 
any increasing function of the number of moves before capture, his 
best strategy is also never to return to a room in which he has previously 


hidden. A formula for the value of the game is presented. 
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INTRODUCTION 

The common situation of search versus evasion can be described 
precisely in the language of game theory: Player I, the evader, and 
Player II, the searcher, each choose an integer from 1 through N (each 
integer representing one of N separate rooms). This paaa is repeated 
until either a match occurs (i.e., II finds I) or the game has run some 
preselected number n of moves. In the first case there is no payoff, 
while in the second case II pays I one unit. If no restrictions are 
placed on the strategies of the players the solution of this game is 
trivial--each player chooses randomly with equal probability from the 
complete set {1, 2, ... N} at each move, and the value of the game is 
(1-1/N)". 

This paper investigates the solution of the same game under the 
restriction that II's search be systematic--after Player II visits a 
room he may not return to that room until he has searched through the 
N-1 others. It is natural in this case to insist that n be equal to or 
less than N. II's possible pure strategies are represented by the per- 
mutations of n out of the N integers. (A permutation is defined as an ordered 
set without repetition.) From the Symmetry it is easily seen that II's 
optimum strategy is to choose the (N], = N) (N- 1) (N- 2)... 

(N - n + 1) permutations randomly and with equal probability. 

The main result of this paper is a proof that under these circum- 
stances I's optimum strategy is exactly the same as II's; I randomly 
chooses n different integers, never making use of his option to return 
to a room he has already visited. A simple expression for the value of 


the game is derived, also yielding the expected payoff for any other 











strategy Player I may choose. The theorems are actually proved in much 





greater generality--when II is allowed to search more than one room per 
move, when the payoff to I is any increasing function of his survival 
time, and conditionally, when for any reason another rule has been | 
followed up to a certain point. The universality of the "no return 
rule" is of course very useful to Player I, who can play optimally with 
practically no information on the actual structure of the game. 

The condition of systematic search is often encountered in practice, 
either because of ignorance, physical inconvenience, or ulterior motives 
on the searcher's part. The watchman, the detective, and the air recon- 
naissance team anxious to cover as much ground as possible are all likely 
to search non-repetitively. If the evader has no additional information 
on the searcher's pattern, then it is sensible for him to play the "no 
return rule". For in this case the searcher's optimum strategy, all 
permutations equally likely, coincides with the natural "ignorance" 


distribution. 


FORMAL DEFINITIONS AND A STATEMENT OF THE THEOREMS 
In the most general situation discussed in this paper Player II 
will be allowed to search M rooms on his first move, M, on the second, 


... and Ma on the nth, where the M; are fixed constants such that 
= < N. 
M at Moe + Mi No N i 
(He searches without repetition, as explained in the introduction.) 
Player I is allowed to hide in only one room per move but he may return 


to rooms he has already visited if he so desires. If Player II first 


finds Player I on the j,th move, I receives a payoff of P(3,) units, 








(Pn+1) being the payoff if no detection occurs, P(1) < P(2) -++-*<s P(nt1). 


Formally we denote II's selection of No different rooms by 


> 


(r11 r12 TT FM? r21 eee 2M? at Yal eee Pon? 


-a permutation of size No = M +M i T Ma from the first N positive 


2 


integers. Player I selects an n-tuple 


Bis Ry ie R) 


from the first N positive integers, repetitions being allowed. Ea is 


E) the rooms searched on 


Player L's hidin lace and fr, t bă 
y g P C Eg m 


the jth move. If Jo is the smallest value of j such that 


r eee F 


R. € 
ri j2 m) 


J 


Player II pays Player I Pj) units. If there is no such value of j, 


I receives P(n+1) units, where the P(j) form a non-decreasing sequence. 


This situation will be referred to as the general case. 


When 
Mi = M, = M, a = a =M 
and 
P(1) = P(2) ... = P(n) = 0 
P(n+1) = 1, 


we shall say we are in the special case. (The special case with M = 1 
the game discussed in the second paragraph of the introduction.) 
Assume first that Player II plays "random permutations,'” that is 


II chooses among the Un, possible permutations randomly and with 
0 








win 


equal probability. The main conclusion of this paper is that Player I's 
best counter-strategy is then also to play random permutations; Player I 
selects the n-tuple (R, Ree RG) randomly and with equal probability from 
the lv, possible permutations of n out of the N integers. But when I 
plays in this manner, any strategy II may choose yields the same expected 
payoff, which implies that random permutations is the optimum strategy 

for both players. In the sequel it is always assumed that Player II, 


but not necessarily Player I, behaves in this way. 


Define R(n, ,ny-- +n) as the set of n-tuples (RiR RD) involving 


k distinct integers with frequencies Ny rar. ny. respectively, where 
necessarily 
non tn a 
1-2 k 


Each element of R(n, ,n,+...n,) is a pure strategy for Player I. It is 


important to note that in the special case, all of these pure strategies 
(and hence any probabilistic mixture of them) give Player I the same pro- 
bability of escaping detection for the duration of the game when Player 
II is playing random permutations. We denote this probability by 


vía, Dn, + 4 ma) 


For convenience we call the set of all mixed strategies for Player I 


involving only the elements of R(n M+ + om) "the strategy S(n,; ny see ny)" 


(or, if no confusion is possible, simply "the strategy S".) Thus 
V(n,,n,-..n) æ V(S) 


is the expected payoff versus random permutations of strategy 
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S(n,,n,...n) EZ S 
in the special case. 


(As a specific example of these definitions let N=8, n=3, n,=1, 


1 


n,=2. Then (R R, R3) = (2,7,7) and (R, -R3 R3) = (3,5,3) are both 


elements of R(n} ,n,). The pure strategy for player I "room 2 first 
move, room 7 second move, room 7 third move” is a member of S(n,,n.), 


as is the mixed strategy '20% of the time room 2 on the first move, 
room 7 on the second, room 7 on the third; 80% of the time room 3 on 
the first move, room 5 on the second, room 3 on the third.") 


For the purpose of evaluating V(n, ong... en) we can use any ele- 


ment of R(n,,n,-..n,) to realize S(n¡,n,...n). A particularly useful 
choice will be 


(R, »R, Ro sand E 


Given 
SE S(n,n,...n,-..n_ ...n,), 
< 
where na Sa) 


the two strategies 


4 
S° z= S(n,,n,-.-n, yn, + A 
and ; 


a” 
S z S (njon e nt Pan ERa ++. Ms 1) 


are both said to be simply derived from S. S, is derived from S, if 


it can be attained from 5, by a sequence of simple derivations. 


Theorem 1 (special case): If S, is derived from S, then v(S,) > v(S,). 


AAA A A EE A A O A A PA SE TE 
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Among all strategies s(n,,n,---n,) such that ny + Ny sss + nm, =n, the 


“no return" rule $= S(1,1...1) maximizes V(S). 
AA A 


n 
Define 
s’ (ny, ny ces ny.) < S(n,; Dn)... ny) 
k< k 
and m =n, 1212 E 


A representative realization of S is the permutation 


(By Myon B AL 2 oBu MEAT ARTEA aléalo ck +1,...k +l...k,k...k) 
See ee a A aM AAA aa A 


O. A dE ny 
which shows that Player I may think of playing strategy S* first and then, 
if he is not detected by Player IL, completing it to strategy S. The con- 
ditional probability that Player I will escape detection playing strategy 
S, given that he has already played S' without detection is defined as 
v(S | S'). (A formal definition of the conditional expectation is post- 


poned until the next section.) 


Theorem 2 (special case): Suppose six Si> Sov 57> and S, is derived 





. = ' t 1 
from S}. Then v(s, | $2 v(s, | 5). Given any strategy S, = S(n;» ngs D kt)? 








where 
nj + n, + mei = a! = ñ, 
Let 8, = § (n;> Ty oes Ty Ls ie 
n-n! 


Then v(S, | 5, ) = max V(S | S.) among all strategies S(n,,n,,-.-1,) such 





that S| < S and n) + n, Eo antro SE nm, =n. (In other words, Player I should 





begin playing the "no return rule” as soon as he is allowed to do so.) 











Given strategy S(n, ,2),+--1,), define 


Q 
ii 


k-1 k 
E OD. cum 


i=1 j=i+1 


(the at are the elementary symetric functions of Ny »M,---n, +) 


Value Theorem (Special Case) 


Mo 2c (-M) Ko 


V(n, ,n ye le wee 4 E ón do . 
y + ENT, [N], N 


In particular for 





S = S(l,1...1), 


n 
n 


h 
ves) 2 CAE 
a 


which is the value of the game (attained when both players play random 
af a a Aa = E A naib 


permutations.) 


Theorem 3 (general case) 


Random permutations is the optimum strategy for both players. 

More elaborate results are obtainable in the general case, but they 
will not be investigated here. It should be noted that theorem 3 holds 
true regardless of the constants MM, M» and hence they need not be 


assumed known to Player I. 
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PROOFS 
The second halves of Theorems 1 and 2 are simple consequences of 


the first statements. To prove these it is sufficient to consider only 


the subcase M = 1, since the strategy 
S = S(n,; Ny ... ny) 
for general M is equivalent to the strategy 


* 
S = S(n,M, nM, State nM) 


for M= 1. Furthermore, using the obvious notation, if S3 is derived 


* * * * 
from S, then 5, is derived from $}, and if Sows then So < Ss. 


In the special subcase with M = 1 we change notation slightly and 
denote Player II's randomly selected permutation by 
(r(1), r(2) ... r(n)). 
en) for Player I can be 


as commented previously, a strategy S(n; >03- 


realized by the n-tuple 


(R, »R,++-Ry) a (l,Lar «lo 2d e vax k,k...k). 


q k Pk 


Define 
i-1 i-1 i-1 
A; = {ra+ A nj), r(2+ ) n,) serm + ) n;)} 1 =-1,Ze..E 
j=1 jel jel 
A, 20 i>k. 


A; is the set of rooms Player II searches while Player I is hiding in 


room i under the strategy (Ry »Ro +++ +R) above. In what follows it is 


assumed that 1 plays this fixed pure strategy, while II plays random 








aie 


permutations. Then the sets A; are functions of the random permutation 


SS Glide FE hv es « r(n) ), and the event of no capture, 


r(j) # R, S ee A n 


is equivalent to the event 


fdo 11,2... k, 


Main Lemma 


k 
Let p, PREP a | 1f4,1=1,2 ... E) 


h=1,2... Ne 
Then if M20, 2...2 Mm; 


< = = ees = 
P1 = Po = oar) Pel Pk+2 Pa’ 


Proof: Set n * 0 for h > k and suppose that for some m and h, lai > n. 


Define 
m-1 m-1 m-1 
B = {r(1+ Fai r(2+ 3 H) ece tn n + ) n,)} 
j=l d jal j m g j=l J 
and A sA -B 
m m 
A, = A, fo E 
Then 


Ph «PABLO TALE k= 
=P(h€B,mf£B,1fA, i= 1,2... k) 


k 
+P(hf B, mf B, he U Ay, i dA, i= 1,2 awe k) 


i=1 
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Pa PCL A, i= 1,2 ... k) 
y k 
= P(heB,mé¿B, i éA; i = 1,2...k, me UA,) 
k a 
+P(h dB, m¢dB,me U Aj, i A; i = Í;2 s.: kĵo 
i=1 


The second terms on the right sides of these two equations are equal by 





symmetry, and comparing the first terms verifies the lemma.’ 


Given 
S' (nj >» ny . ngs) < S(nj; n, +... n) 
define 
iti ial 1-1 
> / 
A; = {r+ $ nj, r(2+ ) n,) ee (nf + ) a} 16 A i 
j=l j=1 j=l 


The value and conditional value are formally defined by 
v(s) = Ven oes n,) =P(if Ay, i = 1,2 ... k) 


v(s | S') s P( fA, i= 1,2 ...k | if A; i=1,2 ... k’). 


Finally, for <n (n, may equal zero), let 
Ph l Pa h 


wn 
fu 


S(n, ny ree DL eee na l ien n,) 


= S(n, ,n, see Mo... Do... n,) 


1 
S, = S(n, »n, see Th Al na sl sss n)- 

Corollary vis, | sy) > ves, | s,). 

Proof: v(S, | So) = 1- (1 - Pi 


N-n+1 


ves, | So) a Ln (1 = Ph) 
N-n+1 








Siis 


Applying the lemma proves the corollary. 
Proof of Theorems 1 and 2 


Applying the definitions, if s/ <s then 


vas) = vas”) ws | s^. 
Thus the corollary implies that V(S,) > v(S,), and iterating this result 
proves Theorem 1. The equality above also shows that Theorem 1 implies 
Theorem 2. 


Define 


t, = {(rQ), r@) ... x) | i ea, fitz... k 


(the elements of T, are permutations of size n from the first N positive 


integers) so that the event 


i f A, i = 1,2 ... k 
L 
is equivalent to the event de 
(00, r(2) ... r(n)) $ U Ti 
i=1 
By direct evaluation 
Mm, Ry vee A; 
P (T T «de Ti j = l a B 
1 2 h 


Applying a well-known Formia yields the value theorem for M = 1, 


and the remark at the beginning of this section extends the result 
to general M. 


In the general case let R> R, see Rp) be any element of 


2 


R(n;> O, «ss n)» m any integer that appears more than once in 


2 


(By > Rs see RD? and h the largest integer such that 


2 


Ra = me 
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It is a simple consequence of the main lemma that the n-tuple 


(Ryo Ry vee Royo Ros Baya os Ry) 


yields a greater expected payoff than (R; > R, sen Rp versus random 


permutations whenever 


R/ t {R> pe R,} - 


Theorem 3 follows by induction. 
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